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ABSTRACT 

Essay questions and the effects of their use on 
student scores are examined by a teacher through collaborative action 
research. Collaboration with the teacher's graduate school faculty 
resulted in refinements to the research question and development of a 
practical study methodology. During the two years of the study, the 
teacher taught four science classes of college-bound high school 
students of average and above average ability and four classes of 
science for vocational students of average and below average academic 
ability. In any one term, he would use essay questions in two classes 
and not use them in the other two. A total of 77 college-bound 
students and 97 vocational students completed the study. In general, 
essay questions, when used, accounted for about 25% of the overall 
test score. Time of day the class was held had no effect on student 
achievement, and, as expected, the classes that took the same 
textbook-supplied objective tests had similar average scores and 
attitudes about testing. When essay tests were used, the 
college-bound students' test scores went up almost 10% overall, with 
increased interest in studying for tests and increased motivation for 
learning the material. In the vocational group, the overall average 
test scores decreased by over 10%, with 5 students decreasing their 
scores for every one that increased his or her score. Students 
expressed dislike for the essay questions and tended to leave them 
unanswered. Additional research is necessary regarding the use of 
essay questions with below-average students. (Contains 51 
references .) (SLD) 
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Essay Questions: Another Look 

Teadier inquiri^ i^vide ^^uable inf(»mation fcx* all teadiers, but middle 
grades teadim es^)ecially benefit fixxn them (Stevenson, 1988). Middle school 
students are undergoing sudi complex development that our teadung techniques 
are rarely d>le to keq> up with their r{q>idly changing educatimial requirements, no 
less their emotional needs. The need to evaluate student learning is surpassed (Hily 
byaneedtoe^^uatehowwe, asteadi^areixovidingfiiatleaming. Our 
inqxessions of fins process are normally limited by the more or less indifferent 
obsorations we make in our daily teaching activities. Teacher inquiry dlows us to 
go beyond superficial inq)ressionsto gain a better understanding of what we are 
aurently acconylidiir^andto evaluate the effectiveness of new ideas and 
strategies we might use to improve as teadiers. Hie purpose ofthis paper is to 
discuss one form of teadier inquiry termed "collabcxative action researdi'' and a 
teacher intpiiiy project in idiich fiiis teacher took another lode at essay questions 
thrxxigh coUalxMrative actim research. 

Background 

F<ar many of us intoested in the contimied inqvovonent of educatiem, tl» 
strategies involved in what is often referred to as "action researdi" hold much 
prcmiise fix’ improving the relationshqis between theory, researdi, aiul practice. 
There are a variety of definitions for action researdi, but for file purposes of fiiis 
discussion it refos to research conducted in a classromn setting and involving the 
teacher and students native to fiiat setting. The addition of the word "collaborative” 
implies that the teadier is {novided assistance fiom cMitside sources to work with 
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those involved in the researdi in the solution of {nx^lems of {xuctice. In its 
simplest form, the research involves the i^ntification of an issue to be researdied 
by tte participants, the formaticm of a plan of ”acti<ni" in which to resolve the issue, 
the collection of data in various forms to substantiate die affects of die action, and 
reflecticm uptm the results of the action. The cycle may be repeated, using the 
information gained to alter the issue to be researched. 

Many different models of action research have been tried and are discussed 
in the literature (Whitford, Schlechty, & Shelor, 1989; Hollingsworth, 1992). 

While differences in af^iroaches exist, two themes tend to unify the various f(»ms 
of collaborative action research. One theme concerns the relationship between 
reflection and action as an cmgoing model of particqiant developmod. The second 
theme oiqdiasizes collaboration as a "means” of linking reflection and action. In 
most instances, the collalxarating partners are teadi^ re{»iesenting the practical 
WOTld of action and practice, and university researchers, refxesenting the reflective 
w<n1d of theory and inquiry. Collaborative acticm research rejnes^its a renaissance 
within educational research (Oja & Pine, 1987). 

Literature Review 

Hie use of action-oriented teacher researdi as a strategy fw increasing 
teachm* motivation toward reflective teaching practice and self inqnt>vement is not 
new. John Dewey (1933) published works pnnnoting teachers as a valuable 
resource in the researdi enviixmment Kurt Lewin (1947) is most often cited as the 
founder of action researdi since he referenced the "actions” resulting from group 
dynamics. His researdi models were aj^lied to the field of educati<m by mid- 
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century and e?q)eriaiced teadiers throug^iout the country were being ^icouraged to 
participate in various kinds of action research (Cory, 1953; Shumsky, 1958). The 
idea of a "collatxM^ve" effort was d^cmstrated by Corey and ejq)anded upcm by 
Schaefer (1967). 

About the same time, a ^itish inoject, ran fr(Mn 1967 to 1972, was 

initiated by the Humanities Curriculum Project under the direction of Lawrence 
Stenhouse. Stoihouse (1979, 1985) extoided the base of collaborative knowledge 
and established an interpretive framework fix>m which action research theories 
could evolve. By the 1970's, interest in collabordive action research was 
increasing (Co<^)er, Barrett, Hayhoe, Hotnxnig^ Rowe, & Rumsby, 1975; Elliott, 
1977) and the late sevoities saw the development of collaboative acticm research 
models and their application by Oja (1979, 1980), Pine (1979a, 1979b, 1980), 
Hod (1981), Ward & Tikunoff (1982), Smulyan (1984) , and othos. 

Additionally, reports and articles enqrfiasized the importance of involving teadiers 
in research as a way to link theory to practice (Huling, Trang, & Correll, 1981; 
S>ices, 1984). If teachers develop research skills, it was argued, then university 
researdiers would be more willing to wok collaboatively with public school 
people and as university researchers spend more time in public classrooms, both 
will be better able to create theory out of imictice as well asputtheoy into 
practice. The most current work in collaborative action research has been a 
development of the wok doie with {sreservice teadier educatioi programs 
(Beckman, 1957; Perrodin, 1959) and inservice teachers through projects initiated 
by 2^ichner(1993), Gore &2^ichner (1991), Sago(1991), McTaggart(1991), 
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NoflBce (1990), Oberg (1990), Schon (1990), Lucas (1988), Keimnis (1984), and 
other interesting projects in various stages of progress. 

Rationale for the Study 

TTie need for teachers to become researchers seems easy to affirm and it 
appears to make good sense, but educatcHs are often concerned aboutjust what is 
meant by the dictum that teachers should be researchers in their own classroom 
(Anning, 1986). Many experioiced teachers have nev^ learned research 
techmques or developed questions through reflecti(m on their own teaching 
practices, nor have they any eiqienence collecting, analyzing, or pres enting 
Specific activities that increase teadiers' reflections <mi their teaching and their 
participation in inquiry have been the focus of teacher research in recent years 
(Kelly, 1985; Sdion, 1990). Collaboration has been consistently used as a 
of bridgii^ the gap betwem the autonomous environments of public educatkm and 
university research to provide both groups with needed information. 

In a changing, ctmqilicated society, inter-relaticmships aiiKMig once 
independent sub^Dups grow as orga niz at i ons find themselv^ ecologically bound 
to each other (Pine & KeaiK, 1986). Collaborative research is essentially joint 
endeavors of uitonomous units to adiieve outcomes desired by all. These 
mitcomes, however, are often beyond the grasp of either single group ac ting alcme. 
Collab(»^(m is a ftieoretical partnership, but one in vriiich the collaborators can be 
informally organized and quickly modified ot dissolved as circumstances change. 

In collaborative action research, each educational canqi shares energy, expertise, 
time, and (rfh^ resources to plan programs of preservice and inservice education. 
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perscMial and group researdi, curriculum develc^mieiit, and sdiool improvement 
Action researdi in these specific forms exhibits a wide range of variation because 
there are many possible **discourses'' of action and therefore many ways in whidi 
acti(m research can be thecxrized and practiced (Kosmidou & Usher, 1991). It is the 
existence of these into’-related the<mzati(His and fx'actioes diat has led to the 
increased activity in the field during the last two decades. As more and m(x% 
researdi has been cratored in the clasooom, it is <mly natural to incocp(Htite the 
resources available. The increased stress on "applied* questions, on "relevance," 
and on the "servicing" function of impiiry have been a welcomed change in the 
research literature (Nixon, 1986). 

Mediodological hnplications 

The increased interest in collaborative action researdi has also been due, in 
part, to die concurroit ^wth in the use of qualitative, field-based methods in 
educational researdi (Hovda & Kyle, 1989). These methods sedc. participants' 
unctei^tanding of classroom intouctimis and dep^ upcm descripticm ami 
interpretation through loi^-term assodations in specific settings. Aimed at 
develc^ing an understanding of (XHistnicted realities, these tedmiques lend 
thenxselves well to the {diilosqdiies and goals of action researdi (Bassey, 1986). 
Collalxxrative action researdi does not require sophisticated methods, ikh- expert 
tedmical researdieni. "Contamination "offiie data is not a concern. Ifthe 
involvement of the coHaboratcxs "affects" the outc(Mne in the direction desired, then 
the research has been successful. The purpose is not to measure and [xedict, but 
rather to effect diange. Ahhou^ intentional, systematic, and as thorough as any 
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research, the process is tncneanergeiit than prescribed, with the direcdcm of the 
research often "emerging'' fixnn the process itself. As wifti the following study, the 
researchm are oftm {n'actitioiMs inquiring into their own classrooms with the hdp 
and support of collabOTation with higher education perscnmel. The research grew 
out of my questions, as a classoxxnn teacher, about my own inactice. Iwantedto 
solve problems I was experiencing as a new teacher and sou^ help fircnn 
university inofesstKs during graduate studies. Ahhou^ the researdi is both 
(H^cdtioner and practice centered (Hustler, Cassidy, & Cufi^ 1986), the questions 
and resulting descriptions and interpretations may be relevant to contexts other than 
the particular one investigated and may create new questicms in the process 
(Codnran-Smith & Lytle, 1990). It is through Hus process of questitming, pl anning , 
activatii^ reflecting, and re-questioning that we as researdiers develop 
coll^xHutively, broaden our understanding of >^hat is being transferred to the 
student, and create an envinmment which facilitates this transfer. 

Developing the Question 

As a new teacher I found myself facing about a himdred bright-eyed science 
studoits each day (xmiplete with my lesson plans, roll bo<^ discipline charts, 
'liands on" laboratory activities, specimens, lab equifxnent, textbodcs, overhead 
tranqiarencies, tardy slips, hall passes, and the growing pile <^"classw(Hk" papers 
on viiidi to give "feedback." When it came time to evaluate the students with the 
first diapter test, I caiuKrt express the joy with whidi I discovered the exams 
inovided by the textbodc conq>any that could be madiine scored. They were 
complete with a great variety of muttiple-dioice, true-false, and matching questions 
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that thorou^y covered the material in the chapter. They even provided review 
sheets that could be used in my review lesson. All I had to do was to use the 
"bubble in" cards, patrol the isles to prevent co<^)erative test-taking, and tun the 
cards through the machine. The answers were corrected and grades provided that 
could be easily transferred to my grade book. I could even go over the test with tlM 
students and review the areas in which they had problems. What a great system, 
who said teachers idiould "reinvent the wheel" and make up their own tests anyway? 

It was about die end of the second duqiter diat I noticed a few flaws in the 
system. Several of the better studoits, who knew the material very well, did not 
score well on the test Also, several of the poor students, wiio did not know the 
matoial at all, did (XMisidoablybettOT (Ml the test than I expected. Finally, I wanted 
to add s(Mne enrichment lesscMis to die next diapter and I didn't know how I was 
going to inc(Mp(Mate the material on the next test I to(dc my jMxiblems to a grachiate 
educati(Mi class I was taking and was directed to journal articles <mi altemative and 
multiple assessm^ whi(di gave me several i^w ideas, but little jMactical 
applicati(ML I was fortunate, then, to be directed to a series of professors who 
w(Miced with me fcx* sevanl years in the iqiplicaticMi and ^dy of these ideas as 
(xiUabcMative acticMi research. The soluti(Mi of my (diapter test (xroblem throu^ die 
use of essay (piesticMis is the basis of this study. 

Advantages and T .imitations of Essay Questions 

Essay (]uesti(Mis pmnit teadiers to evaluate learning not measurable by 
objective tests (Gronlund, 1985). True-false and matching items measure (Mily the 
simplest levels of expected learning outcxMnes, multiple (^ice items can measure 
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(xmq)rdi^i(m and |m>blem solving, but essay itons can evaluate more COTiplex 
skills (Wiersma & Juts, 1985). Essay qu^ons diallenge students to use their 
oiticai thinking skills, lequiringthrai to compare, justify, contrast, c«iq)ile, 
interpret, or finmulate valid conclusions - all higher-order skills (Ck^er, Kolstad, & 
Sosa, 1988). Responding actequately to essay questicms is mcne difBcult than 
di^laying passive knowledge on madiine-scored items (Oronlund, 1985), but 
allows the student the oppCHtunity to di^lay the iMToadest range of cognitive skills. 
The use of essay questions enables the teadier to better differentiate students who 
have studied the material and j^q>ared fen* the test, fiptxn those who have nc^ They 
also allow teadiers to evaluate organizing skills, reasoning abilities, individual 
motivatiem, and language arts abilities of stuctoits. Finally, essay itons can be used 
to evaluate the effectiveness of enrichment material created by die teadier that is 
not covered by othor parts of the exam. Some (xoblems with essay questions 
include the difficulty of evaluatii^ answers in a fair, equitable, consistent manner 
and the fact that they are nKMre time ctxisumii^ to |Miq>are and grade. Limitations 
should be considered, but most teadiers can easily overcome these difficulties and 
there are plmty of helpiiil suggestions in the literature (Gaffiiey, 1992; Tuckman, 
1991; Bizzell & Singleton, 1988; Bladcey, 1988; Coker, Kolstad, & Sosa, 1988). 
Develomnent of the Study 

Armed with my new knowledge of assessment, I discussed the application of 
essay questiems to my diapter test problem and a study of the affects of the 
applicati(m with the university {xofessors. We came up with the researdi question: 
How does the addition of essay questions to elective teds affect student scores? 
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We felt that this questicm would give us some directim toward our larg^ goal of 
inq)roving the effectiveness of my ctuqjter tests and h<^)ed diat continued direction 
would emerge frran the data. 

During the two years of the study I tau^ four classes of "coUege-bound** 
science students (average and above average) and four classes of "vocaticmal” 
science students (average and below average). During any (me term, I would have 
two classes in which I used essay cpiestions and two classes in vdiich I did not 
Additionally, I would have one class before lunch and cme class after lunch 
participate so that I could (^eck. ftn* time of day variaticMis. Data collection 
consisted of test secures, student interviews, teacher observatiems, and my perscmal 
log. Test (flanges began in the seomd grading pericxl each year so that baseline 
data could be established A total of seventy-seven college bound students and 
ninety-seven vocaticmal students completed the study. 

The essay (juestions used were teacher made and were topic specific, but 
open to allowing students flexibility in e^qnessing what they had learned about the 
topic: Tell me about the steps involved in ...” or "Discuss \^hat you have learned 
in our lab activity with ... " or "Explain what you know about the tnocess of..." 
Pri(}r to grading the ejuestions, I outlined the areas of acceptable response and gave 
each sectiem a point value. The students were {prepared for the essay herns through 
samples e}q>lained in lecture, {nactice perfenmed as classwork, and discussion of a 
variety of student re^xmses. The number of essay cpiesticms used per test varied 
fit)m three to five, accounting for about twenty-five percent of the overall test score. 
I found that keeping the numbo* of (]uestions low helped keep the grading time 
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reasonable. On the tests wfaidi included essay questions, I reduced the number of 
olqective Hems, especially in the areas covered by the essays, so diat the overall 
time needed to (xnnplete the test was mH appreciably increased. 

By adding essay questions to the textbodc supplied tests in this way, I was 
able to maintain the advant^es of a madiine-sc<Hed objective te^ include 
enrichment lessons, allow students to express >^iat they had learned, and compare 
these results to evaluations without essay que^mis cov^ing the same tc^ics. In 
addHi(»i, by starting the year with objective <mly tests, I was able to better 
detmnine the overall effect of adding essay questions. The main disadvantage was 
the increased time in [neparing and scoring the essay Hems and I feH die time was 
well spoit in return for tl^ benefits and knowledge gained. 

Results 

I could detect no tinM of day variations and, as expected, the classes i^ch 
took the same textbook supplied objective tests dirougbout the year did not change 
much in their average scores or in their attitudes about testing. They did increase 
sli^Hly as the year progressed and probably because, as caie student said in an 
interview, **I got used to tlK kinds of questicms asked <m the test and figured out 
what to study." 

The surprise came in the difference between the college-bound groups and 
the vocational groups. In the college-bound group, the average test scores went up 
almost ten p^cent overall, wHh nine studoHs ino:easing for every <me that 
decreased. Teadier observations by myself and others indicated an increased 
interest in studying for tests and increased motivation for learning the material 



Essay Questions 
12 

during class time. My log was full of c<Mnm«rts like, **Sue asked me to review the 
process in full so she (xnild get it right in her notes" and "Bob and Brian were 
drilling eadi other about the steps in frfiotosynthesis before the test" Studeirt 
interviews included many positive responses as well, "I like being able to write 
about what I know" and "1 can tell you what I studied " or "studyii^ for those 
[essay questicms] helped me on the other questions." 

The vocaticxial gitHip, however, was anotl^ sdoiy. The overall average test 
scores for this group decreased bv over ten percent with five students decreasing 
f(MT every <xie that increased. To add to the problem, the numb^ of students in the 
"failing" categcxy increased by almost twenty percent Teacher observations 
showed very few positive entries and a number of negative ones like, "Stacey imt 
studying for tests at all anymore" and "Paul's interest in science is decreasing." 
Student interviews shed a little light on the problem with responses like, "I just 
dont write well" and "I cant explain this stuff in writing" or "I dont know where to 
begin" or "I nev^ can remember all the steps." Even giving partial credit for 
partial answers did not seem to encourage these students to try to answer questioos 
with w^di they were not familiar. As a result, many questions were left 
conq>letely unanswered. 

Discussi<Mi and Implications fiy Future Researdi 

Naturally, the negative results from the vocational group led to new 
questimis ccmceming how to maintain the advantages of using essay questions and 
avoid the detrimental affect on the grades of average and below average students. 

4 

Special study strategies and ahemate assessm^ techniques were tried with these 
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studoits durii^ the seccmd year of the study and will be discussed in hiture pap^s, 
but the overall effect of the addition of essay questions was not altered Therefore, 
as we todc anotl^ locdc at essay questions, ^^lat we discovered in this case was 
that for teadiers, above average students, and some average students (especially 
those seddng higher education) essay questions had a variety of positive effects. 
Teachers could evaluate material they taught not covered in the textbook, better 
detaminevlut and how stuctentswope studying in preparati(Mifc«’ the tests, learn 
more ^x>ut individual motivations and attitudes, and spend a reascmable amount of 
time inreparii^ and grading tests. Students could better express wiiat they had been 
studying, learn material as cohesive units, and maintain a positive attitude about 
the coursework and testing. 

for below average and some average students (especially those not seridng 
hitler educaticm), some a<yustnients or ccxnpensations might be needed to 
counteract negative affects on their grades. The problem will be magnified when 
these students are found in heterogeneous classes where no distiiKtion is made 
between vocational and college-bound students, wfaidi is the current trend in 
educati<»L Therefixre, more researdi is recrnmnended in the use of essay questions 
widi below average students. 

As ateachor, I was impressed with how much I learned throu^ the process 

of collaborative action research and diallenge other teadiers to devote some time to 

this eiilighteniiigf<Mrm of development Thejustificationofeducatimial researdi is 

the extent to \diicfa it helps transform educational [xactice in sdiools (Kemmis, 

« 

1984). Coilalxx^ve action research per se is of little use if it does not inqnuve 
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our ability, as teachers, to transfer knowledge with undostanding to our students. 
However, if we, as a result of our involvement in coUabcnative acticm researdi, 
begin to reflect critically our own professional acticms and belie&, then teacher 
research becomes teadier development (Carr & Kemmis, 1983). Through this 
fMXKess of self-c(Hiscious scrutiny, we can the<»ize chit practice, revise our theOTies 
in light of reflective eTqperieiKe, and transform our practice into informed changes in 
(Mir b^vicn*. Imt it great that the ones w1k> benefit the most will be our students? 
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